Robust Imitation of Diverse Behaviors

نویسندگان

  • Ziyu Wang
  • Josh S. Merel
  • Scott E. Reed
  • Nando de Freitas
  • Gregory Wayne
  • Nicolas Heess
چکیده

Deep generative models have recently shown great promise in imitation learning for motor control. Given enough data, even supervised approaches can do one-shot imitation learning; however, they are vulnerable to cascading failures when the agent trajectory diverges from the demonstrations. Compared to purely supervised methods, Generative Adversarial Imitation Learning (GAIL) can learn more robust controllers from fewer demonstrations, but is inherently mode-seeking and more difficult to train. In this paper, we show how to combine the favourable aspects of these two approaches. The base of our model is a new type of variational autoencoder on demonstration trajectories that learns semantic policy embeddings. We show that these embeddings can be learned on a 9 DoF Jaco robot arm in reaching tasks, and then smoothly interpolated with a resulting smooth interpolation of reaching behavior. Leveraging these policy representations, we develop a new version of GAIL that (1) is much more robust than the purely-supervised controller, especially with few demonstrations, and (2) avoids mode collapse, capturing many diverse behaviors when GAIL on its own does not. We demonstrate our approach on learning diverse gaits from demonstration on a 2D biped and a 62 DoF 3D humanoid in the MuJoCo physics environment.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Evolution of Behaviors through Embodied Imitation

This article describes research in which embodied imitation and behavioral adaptation are investigated in collective robotics. We model social learning in artificial agents with real robots. The robots are able to observe and learn each others' movement patterns using their on-board sensors only, so that imitation is embodied. We show that the variations that arise from embodiment allow certain...

متن کامل

Imitation, simulation, and schizophrenia.

The social significance of imitation is that it provides internal tools for understanding the actions of others by simulating or forming internal representations of these actions. Imitation plays a central role in human social behavior by mediating diverse forms of social learning. However, imitation and simulation ability in schizophrenia has not been adequately addressed. The major aim of the...

متن کامل

Dolphin Imitation: Who, What, When, and Why?

The imitative ability of nonhuman animals has intrigued a number of scholars and, in doing so, has generated a considerable amount of controversy. Although it is clear that many species can learn via observational learning, there is a lack of consensus concerning both what sorts of things can be learned by watching others and what types of observational learning should count as imitation. These...

متن کامل

The Role of Affect in Imitation: an Epigenetic Robotics Approach

In animals, humans and robots, imitative behaviors are very useful for acting, learning and communicating. Implementing imitation in autonomous robots is still a challenge and one of the main problems is to make them choose when and who to imitate. We start from minimalist architectures, following a bottom-up approach, to progressively complete them. Based on imitation processes in nature, many...

متن کامل

Is Bayesian Imitation Learning the Route to Believable Gamebots?

As it strives to imitate observably successful actions, imitation learning allows for a quick acquisition of proven behaviors. Recent work from psychology and robotics suggests that Bayesian probability theory provides a mathematical framework for imitation learning. In this paper, we investigate the use of Bayesian imitation learning in realizing more life-like computer game characters. Follow...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017